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DISCRETE FILTER 

BACKGROUND 
The invention generally relates to a discrete filter. 

Electronic filters may be used in a wide range of applications. As examples, filters may 
be used in digital imaging applications to filter out noise, and filters may be used in wireless 
communication applications to reduce inter-symbol interference and aliasing effects. One general 
type of filter is a discrete, or digital filter, that may form the basis of most digital signal 
processing (DSP) operations. 

The digital filter may be a finite impulse response (FIR) filter, a filter in which the 
impulse response of the filter is zero outside of a predefined interval; or an infinite impulse 
response filter (IIR), a filter in which the impulse response of the filter does not fall to zero 
outside of a predefined interval. As an example, the FIR filter receives indications of sampled 
values (called x()) of an input signal to produce digitized values (called y(i)) of the filtered output 
signal, as described by the following equation: 

MO = Z C(j) ■ x(i + j) = C(0) + X {C(j) ■ x(i + j) + C(-j) ■ x{i - j)} , Eq. (1) 

j=-N 1=1 

where "C(j)" represents a particular filter coefficient and "j" denotes the index value used in the 
summation. The number of coefficients (2N+1) in Eq. 1 equals the number of taps of the filter. 
The filter may be further classified as being symmetric or asymmetric. For a symmetric filter, 
the C(j) coefficients are symmetric about j=0. Thus, C(-n)=C(n) for a symmetric filter. 
Otherwise, the filter is asymmetric. 

A conventional filter may use a general purpose DSP processor to perform the summation 
described by Eq. 1 to derive each y(i) value. In this manner, the DSP processor may include a 
multiplier-accumulator (MAC) unit that uses a multiple number of processing clock cycles (one 
for each multiplication, for example) to derive each y(i) value. Unfortunately, the processing 
time of this arrangement may be too long for some applications. Alternatively, an arrangement 
called a tapped delay line may be used to produce the y(i) values. The tapped delay line includes 
serially coupled processing stages, and each processing stage is associated with one of the C(j) 
coefficients. Unfortunately, a conventional tapped delay line is specifically designed for a 
specific number of taps, a feature that limits the number of filter designs that may use this delay 
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line. Furthermore, due to its size, the tapped delay line may occupy a large semiconductor area 
and may consume a considerable amount of power. 

Thus, there is a continuing need for an arrangement that addresses one or more of the 
problems that are stated above. 

5 

SUMMARY 

In one embodiment of the invention, a digital signal processing circuit includes a 
processing chain that has a selectable number of taps and a tap selection circuit. The tap 
selection circuit is coupled to the processing chain to establish the number of taps of the chain. 
10 Advantages and other features of the invention will become apparent from the following 

description, drawing and claims. 

i BRIEF DESCRIPTION OF THE DRAWING 

ijj Fig. 1 is a schematic diagram of a digital filter system according to an embodiment of the 

!"15 invention. 

U Fig. 2 is a schematic diagram of the processing chain of Fig. 1 according to an 

SJ 

embodiment of the invention. 
f7 Fig. 3 is a schematic diagram of a processing unit of the chain of Fig. 2 according to an 

Q embodiment of the invention. 

[£20 Fig. 4 is a more detailed schematic diagram of the digital filter system of Fig. 1 according 

^ to an embodiment of the invention. 

Fig. 5 is a schematic diagram of a unit that may be replicated to form a processing chain 
according to another embodiment of the invention. 

25 DETAILED DESCRIPTION 

Referring to Fig. 1, an embodiment 5 of a digital filter system in accordance with the 
invention includes a systolic processing chain 10 that has a selectable number of taps. Due to 
this architecture, the system 5 may be used to form a wide range of digital filters, as the number 
of taps may be selected by a programmable tap selection circuit 12 that is coupled to the 

30 processing chain 10. As further described below, output terminals 1 1 of the processing chain 10 
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may indicate a filtered output value (called y(i)) on each cycle of a clock signal (called CLKj), a 
feature that results in 100% throughput for the processing chain 10. 

As an example, in some embodiments, the digital filter system 5 may be used to 
implement a symmetric finite impulse response (FIR) filter. Due to the symmetry (C(n) = C(-n)) 
5 of the filter coefficients for this type of filter, Eq. 1 may be simplified, as described by the 
following equation: 



In Eq. 2, each "x()" denotes a particular input value, "CO" represents a particular filter 
coefficient and "y(i)" represents a particular filter output. It may be noted from Eq. 2 that 
10 although the number of taps of the filter is 2N+1, the number of filter coefficients (i.e., C(0), 
Q C(l), C(2), . . . C(N)) that are used by the filter is equal to N+l. Thus, except for the C(0) filter 

coefficient (that is associated with one tap of the filter), each C(j) filter coefficient is associated 

_3 
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Ly with two taps of the filter due to the symmetry. For example, N equals three for a seven tap 

l~i filter, and as an example, a particular output value y(10) for the seven tap filter may be described 

p~ E 15 by the following equation: 



Thus, except for the C(0) coefficient, each C(j) coefficient is multiplied by a pair of input values 
(i.e., x(i-j) + x(B-j)). Referring to Fig. 2, in some embodiments, these multiplications may be 

20 performed by N+l processing units 20 (processing units 20 0 , 20 b . . ., 20 k , . . . 20 n _ l5 20 n , as 
examples) of the chain 10, each of which exploits the symmetric property of the filter by 
multiplying a different C(j) coefficient by the appropriate pair of input values. As further 
described below, the processing units 20 form a systolic architecture, an architecture in which all 
of the processing units 20 are producing products on each clock cycle of the CLK, signal so that 

25 the chain 10 produces a different output value on each clock cycle. 

More particularly, the processing units 20 are serially coupled together to form a serial 
chain for forming the output values, a chain in which the processing occurs from the processing 
unit 20n to the processing unit 20 0 . Each processing unit 20 is associated with a different filter 
coefficient (i.e., each processing unit 20 is associated with two taps of the filter) and generates a 

30 corresponding product for each output value. In this manner, each processing unit 20 receives an 




Eq. (2) 



XI 0) = C(3) • {x(9) + jc(13)} + C(2) • {x(8) + x(12)} + C(l) • {jc(9) + x(l 1)} + C(0) • jc(10) 



Eq. (3) 
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indication (via accumulation input lines 102) of an ongoing sum from the predecessor processing 
unit 20 (except for the first processing unit 20n) in the chain 10, updates the ongoing sum with an 
additional product and furnishes an indication of the ongoing sum (via accumulation output lines 
1 10) to the successor processing unit 20 (except for the last processing unit 20 0 ) in the chain 10. 
5 Referring to Fig. 3, as an example, a particular processing unit 20k receives three input 

signals that indicate three respective values: p(k), the broadcast input value (from a broadcast 
input line 205) that is equivalent to some x value; r (k+1), a delayed input value indicated by the 
predecessor processing unit 20k+l (not shown) in the chain 10; and q(k+l), an ongoing sum 
value indicated by the predecessor processing unit 20k+l in the chain 10. The processing unit 
10 20k furnishes two output signals (to the successor processing unit 20k- 1 (not shown)) that 

indicate two respective values: r(k) and q(k). Mathematically, r(k) and q(k) may be described by 
the following equations: 



g #■(*) = r(* + l) Eq.(4) 

4- 

jfjl 5 q(k) = q(k + 1) + C(x) • {p(k) + r{k + 1)} Eq. (5) 

Ly Based on the above-described principle of operation, it may be observed that r(k+l)=x(i-j) when 

" p(k)=x(i+j), and thus, for these input values, q(k) = C(j).[x(i+j)+x(i-j)]. 

p Referring back to Fig. 2, as an example, in some embodiments, the processing units 20 0 , 

Q20 20„ . . ., 20 k , . . . 20 n .„ 20 n are associated with the C(n), C(n-l), . . . C(k), . . . C(l), C(0) 

coefficients, respectively, and the processing chain 10 begins with processing unit 20 n and ends 
with the processing unit 20 0 . Thus, as an example, for a particular output value, the processing 
unit 20n provides the first product (called the C(0) product) by multiplying the x(i) value by 
C(0). It is noted that for j=0, x(i+j)=x(i-j)=x(i). The product that is provided by the processing 
25 unit 20n begins a sum to which all the processing units 20 contribute another product. In this 
manner, the processing unit 20n-l receives signals from the processing unit 20 that indicate the 
C(0).x(i) product. The processing unit 20n-l adds the term C(l)-[x(i+l)+x(i-l)], called the C(l) 
product, to the ongoing sum and furnishes signals to the next processing unit in the chain, etc. 
Eventually, the processing unit 20 0 adds the last product (the C(N)*[(x+N)+(x-N)] product) to the 
30 rolling sum to generate the signal at the output terminals 11. It is noted that when the processing 
chain 10 receives x(0) to begin the filtering, N-H clock cycles are consumed to produce the first 
valid output value. However, thereafter, the processing chain 10 produces an output on every 
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clock cycle, thereby resulting in 100% throughput. 

Referring back to Fig. 3, as an example, in some embodiments, the processing unit 20k 
may include input 22 and output 24 registers that delay the digital signal that indicates each 
r(k+l) value before communicating the signal to the successor processing unit 20k- 1. In this 
5 manner, in some embodiments, the input register 22 receives the digital signal that indicates the 
r(k) signal (via the input lines 104) on a positive edge (for example) of a processing clock signal 
(called CLK,) and communicates the stored digital signal to the output register 24 on the next 
positive edge (as an example) of the CLK! signal. The output register 24 indicates (via the 
output lines 108) the stored digital signal (i.e., indicates the r(k) value) to the successor input 
10 register 22 in the processing chain 10. 

The processing unit 20k also includes an adder 32, a multiplier 34 and an adder 36 to 
generate the q(k) value. In some embodiments, these components are clocked by a clock signal 
(called CLK 2 ) that is synchronized to the CLK! clock signal and has a frequency that is a 
multiple of the frequency of the CLK, clock signal so that the q(k) signal is generated on each 
f^j 15 positive edge (for example) of the CLK! signal. In this manner, the adder 32 is coupled to 
hi receive the digital signal that indicates the r(k+l) value synchronously with the reception of the 

r(k+l) value by the input register 22. The adder 32 adds this digital signal with a digital signal 
f7 that indicates the current p(k) value to form an indication of p(k) + r(k+l). The multiplier 34 

□ multiplies the digital output signal from the adder 32 with a digital signal that indicates the 

* 20 associated filter coefficient to produce the digital signal that indicates the C(k)-[p(k)+r(k+l)] 
^ signal. The digital signal that indicates the filter coefficient is stored in a coefficient register 30. 

The coefficient may be changed via data and control lines 16 that are coupled to the register 30. 
The adder 36 combines the digital output signal from the multiplier 34 with the q(k+l) signal to 
produce the digital output signal (on the output lines 110) that indicates the q(k) value. 
25 Fig. 4 depicts an example of the integration of the processing chain 10 and the tap 

selecting circuit 12 to form a selectable tap filter 199 that permits the selection of up to seven 
taps. In this manner, the filter 199 includes a processing chain of five processing units 20 0 , 20] , 
20 2 , 20 3 and 20 4 . The filter 199 also includes four termination units 100 0 , 100,, 100 2 , and 100 3 (of 
the same design 100) that are associated with the processing units 20 0 , 20,, 20 2 , and 20 3 , 
30 respectively. In this manner, a particular termination unit 100 may be selected (via the 

appropriate bit in a register 200) to terminate the processing chain at its associated processing 



5 




unit 20. For example, the termination unit 100 2 may be selected to terminate the processing chain 
at the processing unit 20 2 and thus, create a five tap processing chain. Similarly, the termination 
unit 100i may be selected to terminate the processing chain at the processing unit 20! and thus, 
create a three tap processing chain. 
5 The selection of a particular termination unit 100 may be accomplished via selection lines 

103, each of which extends to a different termination unit 100. In this manner, when a particular 
selection line 103 is asserted (driven high, for example) the associated termination unit 100 is 
selected and thus, the number of taps is selected. It is noted that only one selection line 103 is 
asserted, and the remaining selection lines 103 are deasserted (driven low, for example). The 
10 selection lines 103 may indicate respective selection bits of a selection register 200, and the 
selection bits may be stored in the register 200 via data and control lines 201 . 

As depicted by the termination unit 100 2 , each termination unit 100 may include a 

%p multiplexer 124 that selects either the broadcast input lines 205 (when the termination unit 100 is 

£ 

selected) or the output lines 108 (when the termination unit 100 is deselected) of the predecessor 
1^15 processing unit 20 and couples the selected lines to the input lines 104. The termination unit 100 



may also include another multiplexer 122 that selects either the output lines 110 (when the 
termination unit 100 is selected) of the previous processing unit 20 or the lines 118 (when the 
termination unit 100 is deselected) indicative of "0" (i.e., a zero sum) and couples the selected 

Q lines to the input lines 102. 

ff I 

: ~20 Referring to Fig. 5, in some embodiments, the processing unit and termination unit may 

yj 

tfl be combined to form a combined unit 300. In this manner, the unit 300 may be replicated to 

form a processing chain of an arbitrary length. This processing chain may be effectively 
truncated as needed to suit a particular filtering application, as described above. 

While the invention has been disclosed with respect to a limited number of embodiments, 
25 those skilled in the art, having the benefit of this disclosure, will appreciate numerous 

modifications and variations therefrom. It is intended that the appended claims cover all such 
modifications and variations as fall within the true spirit and scope of the invention. 
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